کتاب Knowledge Discovery from Multi-Sourced Data (SpringerBriefs in Computer Science)

وضعیت موجودی موجود

مقایسه شود

قیمت قبلی: 2,550,000 ریال

قیمت: 2,150,000 ریال

جلد سخت سیاه و سفید

Product details

Publisher ‏ : ‎ Springer; 1st ed. 2022 edition (June 15, 2022)
Language ‏ : ‎ English
Paperback ‏ : ‎ 95 pages
ISBN-10 ‏ : ‎ 9811918783
ISBN-13 ‏ : ‎ 978-9811918780

This book addresses several knowledge discovery problems on multi-sourced data where the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used. This book mainly focuses on three data models: the multi-sourced isomorphic data, the multi-sourced heterogeneous data, and the text data. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery and fact discovery on multi-sourced data from four important properties: relevance, inconsistency, sparseness, and heterogeneity, which is useful for specialists as well as graduate students. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to “label” or tell which data source is more reliable. Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery. At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.

منابع کتاب کتاب Knowledge Discovery from Multi-Sourced Data (SpringerBriefs in Computer Science)

این کتاب به چندین مشکل کشف دانش در داده‌های چند منبعی می‌پردازد که در آن تئوری‌ها، تکنیک‌ها و روش‌ها در پاکسازی داده‌ها، داده کاوی و پردازش زبان طبیعی به طور مصنوعی استفاده می‌شوند. این کتاب عمدتاً بر روی سه مدل داده تمرکز دارد: داده‌های هم شکل چند منبعی، داده‌های ناهمگن چند منبعی و داده‌های متنی. این کتاب بر اساس سه مدل داده، مشکلات کشف دانش شامل کشف حقیقت و کشف واقعیت را بر روی داده‌های چند منبعی از چهار ویژگی مهم: مرتبط بودن، ناسازگاری، پراکندگی و ناهمگونی مورد مطالعه قرار می‌دهد که برای متخصصان و همچنین دانشجویان تحصیلات تکمیلی مفید است. . داده ها، حتی برای توصیف یک شی یا رویداد، می توانند از منابع مختلفی مانند کارگران جمعی و کاربران رسانه های اجتماعی به دست آیند. با این حال، داده ها یا اطلاعات پر سر و صدا اجتناب ناپذیر هستند. در مواجهه با مقیاس دلهره آور داده ها، انتظار اینکه انسان ها «برچسب» کنند یا بگویند کدام منبع داده قابل اعتمادتر است، غیرواقعی است. از این رو، شناسایی اطلاعات قابل اعتماد از چندین منبع اطلاعاتی پر سر و صدا، با اشاره به وظیفه کشف دانش، بسیار مهم است. در حال حاضر، تحقیقات کشف دانش برای داده‌های چند منبعی عمدتاً با دو چالش مواجه است. در سطح ساختاری، در نظر گرفتن ویژگی های مختلف ترکیب داده ها و سناریوهای کاربردی و تعریف مسئله کشف دانش در موارد مختلف ضروری است. در سطح الگوریتم، وظیفه کشف دانش نیاز به در نظر گرفتن سطوح مختلف تضاد اطلاعات و طراحی الگوریتم‌های کارآمد برای استخراج اطلاعات ارزشمندتر با استفاده از سرنخ‌های متعدد دارد.